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Abstract. Two-dimensional bootstrap percolation is a cellular automaton in which 
sites become 'infected' by contact with two or more already infected nearest neighbours. 
We consider these dynamics, which can be interpreted as a monotone version of the Ising 
model, on an n x n square, with sites initially infected independently with probability p. 
The critical probability p c is the smallest p for which the probability that the entire 
square is eventually infected exceeds 1/2. Holroyd determined the sharp first-order 
approximation: p c ~ 7r 2 /(181ogn) as n — > oo. Here we sharpen this result, proving 
that the second term in the expansion is — (log n)~ 3 ^ 2+ °^ 1 \ and moreover determining it 
up to a poly(loglogn)-factor. The exponent —3/2 corrects numerical predictions from 
the physics literature. 



I. Introduction 

Bootstrap percolation is a cellular automaton in which, given a (typically random) 
initial set of 'infected' vertices in a graph G, new vertices are infected at each time step 
if they have at least r infected neighbours. In this paper we shall study two-neighbour 
bootstrap percolation on the square grid [n] 2 . We shall determine the second term of 
the critical threshold for percolation up to a poly(loglogn)-factor, and hence confirm a 
conjecture of Gravner and Holroyd [29] . 

We begin by defining the bootstrap process, which was introduced by Chalupa, Leath 
and Reich [19] in 1979. Let G be a graph with vertex set V(G), and for each vertex 
v G V(G), let N(v) denote the set of neighbours of v in G. Now, given an integer r G N, 
and a set of initially infected vertices A C V(G), define A t recursively as follows: set 
A = A, and 

A t+1 = A t U {v g V(G) : \N(v) n A t \ > r} 

for each integer t ^ 0. We say that the vertices of A t have been infected by time t. Let 
[A] = |J t A t denote the closure of A under the r-neighbour bootstrap process, and say 
that the set A percolates if the entire vertex set is eventually infected, i.e., if [A] = V(G). 

We shall be interested in the case in which A is a random subset of V(G). More 
precisely, let us choose the elements of A independently at random, each with probability 
p, and denote by ¥ p the corresponding probability measure. Throughout the paper, A 
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will be assumed to be a random subset selected according to this distribution, unless 
otherwise stated. It is clear that the probability of percolation is increasing in p, and so 
we define the critical probability, p c (G,r) as follows: 



Our aim is to give sharp bounds on p c (G, r). 

Bootstrap percolation has been studied extensively by mathematicians [2j El [16j ETJ HO] , 
as well as by physicists [HEEDED] an d sociologists [2S1SI], amongst others. The bootstrap 
process was originally introduced in the context of disordered magnetic systems, and may 
be thought of as a monotone version of the Glauber dynamics of the Ising model. Indeed, 
if spins are allowed to flip in only one direction (from — to +, say), and if they flip only 
if they have at least r neighbours in state +, then one immediately obtains the cellular 
automaton described above. We refer the interested reader to the applications of bootstrap 
percolation in [23 [38], and the closely related models studied in [TBI 1201 I2T1 [271 1361 139] . 

We focus on the graph G = [n] d with vertex set {1, . . . , n} d , and with an edge between 
vertices u and v if and only if ||w — u||i = 1. Aizenman and Lebowitz [2] determined the 
asymptotic behaviour of p c ([n] d , 2) up to multiplicative constants, and Cerf and Cirillo [H] 
(in the crucial case d = r = 3) and Cerf and Manzo [T7] proved the corresponding result 
for all d ^ r ^ 2. The first sharp threshold for bootstrap percolation was proved by 
Holroyd [31], who showed that 



This was the first result of its type, and has prompted a flurry of generalizations. Sharp 
thresholds have since been determined for p c ([n] d , r) for all fixed d and r [6j [8], for more 
general update rules in two dimensions [22], and in high dimensions (i.e., d = d(n) — > oo 
sufficiently fast) for the majority rule, i.e., r = d [5], and in the case r = 2 [7J. Some of the 
techniques from these papers have been used to prove results about the Glauber dynamics 
of the Ising model [2U [38]. The bootstrap process has also been studied on infinite 
trees [9J [T3], [23] , on 'locally tree-like' graphs [5], on the random regular graph [TUJIM], and 
on the Erdos-Renyi random graph G HjP [35] . 

In this paper we shall study the two-neighbour bootstrap process on the graph G = [n] 2 
in more detail. One of the most striking facts about the result ([TJ stated above is that it 
contradicted estimates of lim p c \ogn given by simulations - in fact, such estimates were 



out by a factor of more than two. (See, for example, [2S] or [30] for a discussion of the 
reasons behind these discrepancies.) Gravner and Holroyd [2H] gave a rigorous (partial) 
explanation for this phenomenon, by giving the following improvement of ([!]): 



p c (G,r) := inf <p : P p (A percolates in the r-neighbour process on G) ^1/2 




(1) 



Pc([n] 2 ,2) <: 



c 



18 log 72 



(log n) 3 / 2 ' 
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where c > is a small constant. In [2H], the same authors proved an almost matching 
lower bound for a simpler model (called 'local' bootstrap percolation), and conjectured 
that the upper bound is essentially sharp for the usual bootstrap process. 

Conjecture 1 (Gravner and Holroyd [29]). For every e > 0, if n is sufficiently large then 

Pc([n]\2) ^ 



18 log n (logn) 3 / 2 6 

In this paper we shall prove Conjectured] in a slightly stronger form. To be precise, we 
shall prove the lower bound in the following theorem; the upper bound was proved in |28j . 

Theorem 1. There exist constants C > and c > such that 

c(loglogn)3 ^ p c (N 2 ,2) ^ n2 



18 log n (logn) 3 / 2 ' 18 log n (logn) 3 / 2 

for every sufficiently large n£N. 

Our result again corrects numerical predictions, this time for the power of logn in 
the second term. Specifically, in work reported in |T], Stauffer interpolated between 
simulations and the rigorous result ([1]) to obtain the prediction 

t\ 12 n \ ^ - 45 



18 logn (logn) 1 - 2 ' 

which is inconsistent with the lower bound in Theorem [1] in the limit n — > oo. Since 
3/2 > 1.2, the true asymptotic approach to the first approximation 7r 2 /(181ogn) is faster 
than in the above numerical prediction. Furthermore, in Section [5] we shall discuss how 
the proof of Theorem [1] can be adapted to a variant of bootstrap percolation called the 
Frobose model. In this case, the resulting power 3/2 of logn corrects the prediction 1.388 
made in [3D] on the basis of computer calculations for the local Frobose model. 

The proof of Theorem [TJ will use many of the tools and techniques of [3Tj . together 
with some of the ideas of [2H], arid some new ideas. In particular, we shall bound the 
probability of percolation by the expected number of 'good' and 'satisfied' hierarchies 
(see Lemma [TJ below). We will define a hierarchy as in [31] (see Section [3]), except that 
our hierarchies will be much finer, each step being of order 1/^/p, instead of 1/p. This 
means that we will have far too many hierarchies; however, almost all of these have many 
'large' seeds, and we shall show that these contribute a negligible amount to the sum. In 
order to do so, we shall need a better bound on the probability that a seed is internally 
spanned than the straightforward bound that sufficed in [31]. Fortunately, the bound we 
need follows easily from the simple (folklore) fact that a spanning set for a rectangle R 
must contain no fewer than <f)(R)/2 elements, where <fr(R) denotes the semi-perimeter of 
R (see Lemmas [2] and [3]) . Surprisingly, it appears that our proof does not extend directly 
to the "modified" bootstrap percolation model; it is the analogous bound for seeds that 
is missing in this case (see Section |5] for more information). 
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We finish this section by making a few definitions which we shall use throughout the 
proof. First, we say a set S is spanned by the set A if S C [A], and that S is internally 
spanned by A if S C [AflS]. An event is simply a collection .4. of subsets of [n] 2 ; we say 
A holds if A G A In order to keep our formulae relatively compact, we shall sometimes 
write I(S) for the event that S is internally spanned by A. 

Next, define two functions, {3 and g, by 

:= !i±V^M and ^) := _iog(/3(l- e - 2 )). 

We remark that /3 is increasing on [0,1], and so g is decreasing on (0, oo), and that 
g(z) ^ 2e~ z when z is large (see Proposition 3 of [5]). Note that ~ ^/m as u — > 0, 
and so g(z) ~ — log-^/i as z — > 0, where g(z) ~ means that g(z)/h(z) —> 1. 
A rectangle is a set of the form 

i? = [(a, 6), (c, d)] := { (x, y) : a < x ^ c, 6 ^ y ^ d) C Z 2 , 

where a,b,c,d G Z. The dimensions of R are dim(i?) = (c — a + 1, d — b + 1), the 
long and short side-lengths of i? are respectively sh(i?) = min{c — a + l,d — 6+1} and 
lg(-R) = max{c — a + 1, d — b + 1}, and the semi-perimeter of R is 0(-R) = sh(i?) + lg(-R). 
We say that a rectangle R = [(a, b), (c, d)} is crossed from left-to-right by A C i? if 

i? C [AU {(x,y) 6Z 2 :^d-l}], 

i.e., if i? is spanned by A together with the set of all sites to the left of R. Note that this 
is equivalent to there being no 'double gap' (i.e., no adjacent 'empty' pair of columns) in 
R, and the final column being occupied. (Here 'empty' means 'contains no element of A 1 
and 'occupied' means 'not empty'.) 

For each p G (0, 1), let q = — log(l — p), so that p ~ q as p — > 0. To motivate this 
definition (and the definition of g(z), above), note (from Lemma 8 of [31]) that for any 
rectangle R with dimensions (a, b), then 

F P (A crosses R from left-to-right) sC e " a9(fc9) . 

We shall use the notation /(x) = 0(/i(x)) throughout to mean that there exists an 
absolute constant C > 0, independent of all other variables (unless otherwise stated), 
such that /(x) ^ Ch(x) for all x = {x\, . . . , If the constant C depends on some other 
parameter y, then we shall write /(x) = O y (/i(x)). 

We shall write M + for the positive reals, and if a, b G M 2 , then we shall write a ^ b if 
di ^ by and 02 ^ bi- Thus a path in R 2 is 'increasing' if it is increasing in both coordinates. 
Finally, if G is a directed graph, then Nq(v) will denote the set of out-neighbours of a 
vertex v in G. 

The rest of the paper is organised as follows. In Section [2] we give an upper bound 
on the probability that a sufficiently small rectangle (a seed) is internally spanned. In 
Section [3] we recall from (31] the notion of a hierarchy, which is fundamental to the proof 
of Theorem [TJ together with some important lemmas from [29] and [31]. In Section H] we 
prove Theorem [H and in Section [5] we mention some open questions. 
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2. A LEMMA ON SEEDS 



In this section we shall prove the following lemma, which bounds the probability that 
a small rectangle is internally spanned. Recall that q = — log(l — p). 

Lemma 2. There exists 5 > such that, for any p > 0, and any rectangle R with 
dim(i?) = (a, b), where a ^ b and ap ^ 5 then 



We begin by recalling a lovely and well-known exercise for high school students (see [2] 
or [12], for example). Lemma |5] follows from it almost immediately. 

Lemma 3. If R is a rectangle, and A internally spans R, then \A n R\ ^ <fr(R)/2. 

We also make a simple observation. 

Observation 4. If z > is sufficiently small then 



Proof. We use the estimates z — z 2 ^ 1 — e 2 ^ z, and y/u ^ f3(u) ^ v^w + u, which are 
valid for small z and u. It follows that 

g(z) ^ -log j3(z) ^ -log(A/z + z) = — log yfz — log(l + \fz) ^ -logV^-v^- 



We can now easily deduce Lemma [2j 

Proof of Lemma® Let m = \A (1 R\. By Lemma EJ if A internally spans R then m ^ 
(a + b)/2. There are at most (^J ways to choose the set A D R, given m, and each occurs 
with probability at most p m . Thus, by the union bound, 



The final inequality follows since Q)p m ^ (^) m ^ (Qaq) m /2 if 5 > is sufficiently 
small, and since 6aq ^ 125 < 1/2. In the second inequality we used p ~ q and m ^ 6/2. 
But logfl/y/ag) ^ — aq, by Observation HI so 




log(l/v^) - < < log(l/v^) + 



The proof of the upper bound is similar. 



□ 





The result now follows, since aq ^ 25, and y/Qe 25 < 3 if 5 is sufficiently small. 



□ 
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3. Hierarchies 

In this section we shall recall some important definitions and lemmas from [29] and [31] ; 
for the proofs, we refer the reader to those papers. In particular, we define a hierarchy as 
in Section 9 of [31] . 

Definition. A hierarchy % for a rectangle R C [n] 2 is an oriented rooted tree G%, with all 
edges oriented away from the root ('downwards'), together with a collection of rectangles 
(Ru C [n] 2 : u G V(G%)), one for each vertex of G%, satisfying the following criteria. 

(a) The root of G^ corresponds to R. 

(b) Each vertex has at most 2 neighbours below it. 

(c) If v G N£ n (u) then R u D R v . 

(d) If N^Ju) = {v, w} then [R v U R w ] = R u . 

A vertex u with Nq (u) = is called a seed. Given two rectangles S C R, we write 
D(S,R) for the event (depending on the set (An R)\ S) that 

# = pu s) nR], 

i.e., the event that R is internally spanned by A U S. 

An event ^4 is increasing if A G ^4 and A C A implies that A' G Two increasing 
events B and C are said to occur disjointly if there exist disjoint sets B C A and Cci 
with B a B and C G C. We write B o C for the collection of such sets A, i.e., the event 
that B and C occur disjointly. We say a hierarchy occurs (or is satisfied by the set A) if 
the following events all occur disjointly. 

(e) For each seed u: R u is internally spanned by A. 

(/) For each pair (u,v) satisfying Nq (u) = {v}: D(R V ,R U ) occurs. 

Given two rectangles S C R, with dimensions (01,02) and (61,62) respectively, define 

dj(S,R) := ^-^ 

for j = 1, 2, and let d(5", i?) = max{di(S, R), d 2 (S, R)}. 

The following definition is slightly different to that in [3T] , and is motivated by the 
method of [29] (see also Lemma [9] below). This definition is necessary because in order 
to prove a sharper result, we need to take a finer hierarchy. In our application we shall 
take T = y/q and Z = log 3 (l/g)/ v /g. 

Definition. A hierarchy is good for (T, Z) G 1R^_ if is satisfies the following. 

(g) UN£ H {u) = {v} and \N^(v)\ = 1 then T < d(R v ,R u ) < 2T. 
(/i) If JV^(u) = {v} and |JV^(u)| 7^ 1 then d(R v ,R u ) ^ IT. 
{%) If |JV^(u)| = 2 and v G ^(u), then d(R v ,R u ) ^ T. 
(j) u is a seed if, and only if, sh{R u ) ^ Z. 

Before continuing, we make a simple observation about the height, h(7i) of a hierarchy 
by which we mean the maximum distance in Gu of a leaf from the root. 
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Lemma 5. Let R be a rectangle, let Z > 1 > T > 0, and let H be a hierarchy for R 
which is good for (T, Z) . Then 



+ 1. 



Proof. Consider a path P of length h{T-L) from the root to a leaf u. Let w be the parent 
(i.e., the neighbour) of u in G%, and note that sh(i? lu ) > Z . Note also that in every two 
steps backwards along P, at least one of the dimensions of the corresponding rectangle 
increases by a factor of at least 1 + T. Hence one of the dimensions goes up by this factor 
at least (h(Ji) — l)/4 times (on the path from w to the root), and so 

Z (l + T )W«)-i)/4 ^ 

The result follows by rearranging and using the inequality log(l + T) ^ T/2, which is 
valid for all T 6 (0, 1). □ 

The following key lemma about hierarchies was proved in [31] . Although our definition 
of hierarchy is slightly different, the proof in our case is almost identical. 

Lemma 6 (Proposition 32 of [31]). Let Z > 1 > T > 0, let R be a rectangle, and suppose 
A internally spans R. Then there exists a hierarchy H for R, which is good for (T, Z), 
and which is satisfied by A. 

Sketch of proof. We use induction on <f>(R); if 4>{R) ^ Z then the result is trivial. Now, 
assume <f>(R) > Z and apply Proposition 30 of [3T] (see also [3] or [1]) repeatedly, each time 
choosing the rectangle S which minimizes d(S,R). We stop when we obtain a rectangle 
S such that either 4>(S) ^ Z, or d(S, R) ^ T. 

There are three cases. If 4>(S) ^ Z and d(S, R) ^ 2T, then Ji has two vertices. If 
4>(S) > Z and d(S, R) ^ 2T, the root of H has degree one, and the rest of H can be found 
by applying the induction hypothesis to S. 

So assume that d(S, R) > 2T, and consider the last application of Proposition 30 of |31j . 
We deduce that there exist rectangles S' and U with [S U S'] = U, with d(U, R) ^ T, 
4>{U) > Z and d(S', R) > 2T, and such that S and S' are disjointly internally spanned 
by A. But d(S, U) ^ d(S, R) — d(U, R) ^ T, and similarly for S'. Thus, applying the 
induction hypothesis to S and S', we obtain a hierarchy % as required. □ 

Finally, recall the following fundamental lemma of van den Berg and Kesten [T2] . 

The van den Berg— Kesten Lemma. Let A and B be any two increasing events, and 
letpe (0, 1). Then 

F p (AoB) ^ ¥ P (A)F P (B). 

We can now easily deduce, as in Section 10 of [3T], our basic bound on the probability 
of percolation. Given a rectangle R and a pair (T, Z) G M 2 , we write H(R,T, Z) for the 
collection of hierarchies for R which are good for (T, Z). 

Recall that P p (/(i2)) and F P {D(S, R)) denote the probabilities in P p of the events U R 
is internally spanned by A" and U R is internally spanned by A U S" respectively. 
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Lemma 7. Let R be a rectangle in [n] 2 , let Z > 1 > T > 0, and let p > 0. Then 
P p ([Anft] = ft) ^ J2 ( II ^p(d(Rv,Ru)) ) ( J] P P (/« 

HeH{R,T,Z) \N£ («)={«} / V seeds u 

(Above and in subsequent usage, the first product is over all pairs of vertices (u, v) of 
ft that satisfy the given condition Nq (u) = {v}, and the second product is over all seeds 
u of ft.) 

Proof of Lemma [7| By Lemma [HI if ^4 internally spans R then there exists a hierarchy in 
ft(ft, T, Z) which is satisfied by A. Hence the probability that A internally spans R is 
bounded above by the expected number of such hierarchies. Since the events I(R U ) and 
DyRw, R u ) are all monotone, and all occur disjointly (see (e) and (/) above), the result 
follows by the van den Berg-Kesten Lemma. □ 

We recall the following lemma of Aizenman and Lebowitz [2] , which is a standard tool 
for proving lower bounds onp c ([n] d ,2). 

Lemma 8. Suppose A internally spans [n] 2 . Then, for all 1 ^ L ^ n, there exists a 
rectangle R, internally spanned by A, with 

L ^ lg(ft) ^ 2L. 

We recall also the following bound on F p (D(R, ft')) from |29j . 

Lemma 9 (Lemma 5 of [29|). Let ft C ft' be rectangles of dimensions (a, b) and (a+s, b+t) 
respectively, and letp>0. Then 

F p (D(R, ft')) ^ exp ( - sgibq) - tg{aq) + 2(g(bq) + g(aq)) + {qst)e 2 ^ +2 ^) . 

The following observation follows exactly as in Lemma 10 of |29j . 

Observation 10 (Lemma 10 of [2H])- Let B > be sufficiently large, and let a G N and 

q > satisfy a ^ B/q. Then 

e 2g(aq) ^ 

aq 

Proof. Let z > and u = 1 — e~ z , and recall that e~ 9 ^ = /3(u). Recall also that 
f3(u) ~ \fz when z — > and that j3(u) — > 1 as z — > oo. Thus, since ft > is sufficiently 
large, it follows that 

for every z ^ ft, as required. □ 
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We shall need a couple more definitions in order to rewrite Lemmas [7] and M in a more 
useful form. Given a, b G M 2 , with a < b, let 



W g (a,b) := inf / (g(y)dx + g(x)dy), 



where the infimum is taken over all piecewise linear, increasing paths from a to b in M 2 
(see Section 6 of [31]). Now, for any two rectangles R C R', and given p > 0, define 

U(R,R') = W g (qdim(R),qdim(R')). 

The following observation is immediate from the definition. 

Observation 11 (Proposition 13 of [31]). Let R C R' be rectangles of dimensions (a,b) 
and (a + s,b + t) respectively, and let p > 0. Then 

sg(bq)+tg(aq) > -U{R,R'). 

Let N(J{) denote the number of vertices in a hierarchy H, and M(7i) denote the number 
of vertices of % which have outdegree two. The following technical lemma was proved in 
Again, although our definition is slightly different, the proof is identical. 



Lemma 12 (Lemma 37 of [3 lj ) . Let T, Z G M + , let H be a hierarchy for the rectangle R 
which is good for the pair (T, Z), and let p > 0. There exists a rectangle S C R, called 
the 'pod' ofH, such that 

dim(S) ^ dim (Ru) 

seeds u 

and 

U(R W ,R V ) > U(S,R) - 2qM{-H)g{Zq). 

N£ n (v)={w} 

We remark that although the rectangle S is not necessarily unique, Lemma [12] allows 
us to select such a rectangle S(7i) for each good hierarchy % G HlR, T, Z). We shall refer 
to this rectangle as 'the pod of W. 

We shall use the following observation to bound U(S,R) from below, and again later 
in the proof of Theorem [T] 

Observation 13. There exists C > such that, for every < a < oo, we have 

g(z)dz ^ ^log U + H + Ca. 

Proof. Let e > be such that Observation H] holds when z ^ e. Then, if a ^ e we have 

f a 1 f a a 1 

/ g(z)dz ^ - / — log z + 2z dz ^ -log - + a + a 2 , 
Jo 2 Jo 2 a 
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as required. Moreover, since g is decreasing, we have 

g(z)dz < ag(e) 



and so the observation follows, since if a > e then J Q a g(z) dz ^ 1 + ag(e) = 0(a). □ 

Finally, we shall use the following lemma, which follows from Lemma 16 of [31] (see 
also Lemma 7 of |29j). 

Lemma 14. Let q > and S C R, with dim(S') = (a, b) and dim(i?) = (A, B), where 
A^B. Ifb^ A, then 

-U(S,R) > ll Aq 9(z)dz+{B-A)g(Aq)-^log(l + ^^ -0(0(5)). 
Ifb>A, then 

-U(S,R) > (A-a)g(bq) + (B-b)g(Aq). 
q 



Proof. Assume first that b ^ A. By Lemma 16 of [31J, the path integral is minimized by 
paths which follow the main diagonal as closely as possible. Assuming for simplicity that 
a ^ b, by following the piecewise linear path (aq, bq) (bq, bq) — > (Aq, Aq) — > (Aq, Bq) 
we obtain 

1 2 f Aq 

-U(S,R) > (b-a)g(bq) + - / g(z) dz + (B-A)g(Aq). 

1 1 Jbq 

Now, by Observation [TBI we have 

-^j h \(z)dz ^ &log(l + i) + 0(b), 

and by Observation H] we have g(bq) ^ ^log(l + l/bq) — 0(1). (Note that inequality is 
trivial if bq is not sufficiently small.) Hence 

2 f bq a + b { \\ 

(b - a)g(bq) --J g(z) dz > —I- log ( 1 + -J - 0(b), 

as required. The inequality for b > A can be obtained by following the path (aq, bq) — > 
(Aq, bq) — > (Aq, Bq), and applying Lemma 16 of (3JJ. □ 

4. The proof of Theorem CD 

In this section we shall put together the pieces and prove Theorem [TJ Recall that, given 
p > 0, we define q = — log(l — p) ~ p as p — > 0. 

Proposition 15. Let C > and e > be constants, let p = p(C,e) > be sufficiently 
small, and let R be a rectangle with dimensions (a,b), where 

e C ( 1 

-^a^6^— log [ - 
q q 
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Then 

F p ([AnR]=R) < exp (- 2 - g{z)dz + (b - a)g{aq) + (tog ^ 

We remark that the constant implicit in the Oc (1) term depends on the constant C, 
but not on the variables p, a and b (and also not on the constant e). 

We begin by defining some of the parameters we shall use. First, set B = Clog(l/q), 
so that a ^ b ^ B/q, set T = yjq, and set 

1 (. l^ 3 



Z = — log- 

Let S* = S'('H) denote the pod of a hierarchy H, given by Lemma [T2l 

Lemma 16. Let C,e,p > 0, a, b G N and the rectangle R be as in the statement of 
Proposition [731 and let B, T and Z be as defined above. Then 



P p (/(#)) < £ exp 

HeH(R,T,Z) 



-C/(5,i2) +O c iV(H) log 
5 V V 9 



P,(/(i2»)). 



seeds u 



Proof. First note that by Observation [TT1 and Lemma [T2| the pod <S = S(H) C R oi H 
satisfies 

^ s^fog) + ttffog) ^ - ^ U(R V , R u ) ^ ~U(S, R) - 2M(H)g(Zq), 
^g w K)=M 9 ^ («)={«} g 

where (a^, 6j) and (<Xj + Sj, &j + £j) are the dimensions of R Vi and respectively. 

Now, by the definition of a hierarchy, we have s, 2T(<2j + s$) ^ 3Taj, and similarly 
^ 3Tbi, for every pair Uj) with Nq (ui) = {vi}. Recall that g(z) is decreasing, so 

max{g(Zq),g(aiq),g(biq)} ^ g-(q-) < log-, 

by Observation H] (applied with z = q). Recall also that Oj, h ^b ^ B/q. 

By Observation [10], it follows that sie 29 ^^ < ABsi/aiq ^ 12BT/q, and similarly 
£. e 29(M) ^ \2BTjq. Thus 



<7(Z ? ) + 2^(a,g) + 2^(&^) + (g S ^)e 2 ^ +2 ^ ^ 51ogi + O ^ T 
and hence, since T 2 = q, B = O c (k)g(l/g)) and M(7£) < iV(ft), 

2M{U)g{Zq) + (2y(6g) + 2#(ag) + (gst)e 25(fe9)+25(ag) ) = O c ( ("log i 

jv^ • (u)={*} V ^ q ' 
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Hence, by Lemma we have 

1 



U(S,R) +O c [N(H) log 



11 ¥ P (D(R V ,R U )) < exp 

Ng u («)={«} 

and so the lemma follows by Lemma [7J □ 




We can now deduce Proposition [TS] from Lemma [HJ The main difficulty lies in the 
fact that there are too many hierarchies: there could be as many as vertices in G^, 
and for each vertex u we have many choices for the rectangle R u . However, most of these 
hierarchies have many seeds, and those with many large seeds have rather small weight 
in the sum. This turns out to be the key idea in the proof. 

Indeed, given a hierarchy of R which is good for (T, Z), define a large seed to be one 
with 4>(Ru) ^ Z/3. We make the following key observation. 

Observation 17. Let % G "H(-R, T, Z), and assume that AT ^ 1 and Z ^ 6. Then every 
vertex ofH is either a seed, or lies above at least one large seed. 

Proof. By the definition of a good hierarchy, either it is a seed, or <f)(R u ) > Z and either 
Nq (it) = {v} or Nq (u) = {v, w}. In the former case we have 

<P(R V ) ^ (l-2T)(f>(R u ) > |, 
since 4T ^ 1. In the latter case, we have <fi(R v ) + 4>(R W ) ^ <P(Ru) — 2, and so 

m&x{<f)(R v ),<f)(R w )} ^ — , 
as required. □ 
Let the number of large seeds in a hierarchy "H be denoted m("H). 

Proof of Proposition [731 Let R C [n] 2 , p > 0, B = Clog(l/g), T = ^/q, and Z = 
(1/y/g) ( log -) 3 be as described above, and suppose that H is a hierarchy for R which 
is good for the pair (T,Z). Recall that p > is chosen sufficiently small, and that 
a ^ b ^ B/q. By Lemma [U "H has height at most (10/y/g) log(l/g), and hence the 
number of vertices N{TT) in G% satisfies 

N{U) ^ 2m(H) ■ h(H) = O ( log -) . (2) 
Therefore, the number of hierarchies with m large seeds (i.e., with m("H) = m) is at most 



(3) 
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Now, for each hierarchy f-L, define 

X(H) := J2 



seeds u 



m(T-L)Z 

and note that X(H) ~£ v ; , and that $(S{U)) ^ X(H), by Lemma[H By LemmaEJ 

3 

for every seed R u we have 

Vp(l(Ru)) < (-<f>(Ru)9{Zq)] 
since sh(i? u ) ^ Z = o(l/q) as q — > 0, and (7(2;) is decreasing in z. Thus 

J] P p (/(K)) < 3 X ™ exp ( - X{U)g{Zq)) . (4) 

seeds u 

We split into two cases. The first is easier to handle, and we shall not have to approximate 
too carefully; in the second the calculation is much tighter. 

Case 1: lg(5) > a. 

We have, by Lemma [T6l combined with ([2]) and (j3J), 



p p (i(r)) ^ Yl sX{n) 

exp 

HeH(R,T,Z) 



-U(S, R) - X{U)g{Zq) + Oc ( ^ flog - 



Recall that a < 4>(S) < X{H), by Lemma [HI and so -U(S,R) > (b - X(H))g(aq), by 
Lemma O Hence the summand above is at most 



3*(«) 



exp 



^(ag) - X(H)(g(Zq) - g(aqj) + O c f ^ (log ^ 



Now, since g is decreasing, X(7i) ^ m(H)Z/3 and a/Z ^ g 1 / 3 , by Observation H] we 
have 

as q — > 0. It follows that 
F P (I(R)) < ^exp (-fe^(ag) - ^1 i og Ij ^ exp (-^J^ g ( z )dz - bg(aq)^ , 

as required. Indeed, we showed that X(H) [g[Zq) —g(aq)) is at least ^y^- log -, and much 
bigger than ^p(logi) 3 , so the first inequality holds. For the last inequality, first note 



that 



- [ C 'g(z)dz ^ alog(l + — J + 0(a) = o[X(H)log- 
Q Jo V a lJ V 1 
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as q — > 0, by Observation [T51 Here we used the facts that aq ^ e and a < X(H). Finally, 
note that, by ([3]), there are at most e x hierarchies with X(H) = X. Hence we obtain a 
geometrically decreasing sum over X, and the claimed bound follows. 



Case 2: lg(S) ^ a. 

By Lemma [HI and since 4>{S) ^ X("H), we have 



X{H) 



2 r aq 

-U(S,R) > - g(z)dz + (b-a)g(aq) - log ( 1 

1 1 Jo 



Hence, by Q2) 



P P (/(/?)) < Yl ex P 

HeH(R,T,Z) 



and Lemma [T6| we have 

g(z) dz - (b- a)g(aq) 
m{%) 



2 

q Jo 



X(H)q 
X{U) 



O (*(«))■ 



log 1 



+ 0(X(H)) + o 



c 



log 



x(n) q 

X{H)g{Zq) 



But by Observation HI 



X(-H) -log 1 



X(H)q 



X(W) 



where C\ 



3 2ci+l 



Note that, for any u, v > 0, the function xlog ( ^b; ) is increasing 

xn(l-C]Z 

if x ^ ux + v, and recall that X(J-L) ^ . Thus, the right-hand side above is 

3 

decreasing in X(l-i) if m{l-L) is sufficiently large, and hence either m(7-L) = 0(1), or 



X(H) 



log 



X{H) 



CiZ(l + X(^)g) 



m{H)Z fm{H) 



6 



log 



4d 



and m(^-i)Zq ^ 1, or the left-hand side is at most 
Finally, recalling that Z = -^(log(l/g)) 3 , we have 



m(H)Z 



log(l/g). 



m{H)Z fm{H) 



6 



log 



4d 



O 



c 



m{H) 



log- 

q 



Oc(l) 

Vq 



log- 

9 



m(H) 



log- 
9 



since either m("H) = Oc(l), or the first (negative) term dominates. 
Putting these various bounds together gives 



F P (I(R)) ^ ex P 

■H£H(R.,T,Z) 



2 

Q Jo 



aq 

g{z)dz — (b — ajg(aq) + 



O c (l)-m(H) 



log- 
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Hence, using 03]), and summing over m("H), we obtain 



P P (/(#)) < exp 



lj\ {z)iz - {h - a)g{m) + 9c^^ 



as required. □ 

Before deducing Theorem [1] from Proposition [151 we need to recall the following fact 
from [31], and to make an easy observation. 

Lemma 18 (Proposition 5 of [31]). 

I » {z)dZ = 18' 
The following observation follows almost immediately from Lemma [TBI 

Observation 19. Let p > be sufficiently small, and let a, b G M. + , with a ^ b and 
b ^ B/2p, where B = 101og(l/p). Then 

2 f aq 2A 

- / g(z)dz + (b-a)g(aq) > 1, 

where X = 7r 2 /18. 

Proof. If a ^ B/4p, then this follows since g(z)dz = 0(g(aq)), uniformly over a G 
(0, oo), and so 

(b-a)g(aq) - - H ' g(z) dz > { f] g{aq) - O ( g -M\ > . 



Q Jag \*P 

If a ^ B/4p then it holds because g(z) ^ 2e~ z for z large, and so 

2 f°° 4 4 

- / #(z)cfe ^ -e~ aq ^ -e~ B/5 ^ 1, 

qJag Q q 

as required. □ 

Finally, we deduce Theorem [1] from Proposition [T5l 

Proof of TheoremUi Let C*2 > be a large constant to be chosen later, let n G N be 
sufficiently large, and let 

7r 2 C 2 (loglogn) 3 

p = — — -, — — . 

18 log n (logn) 3 / 2 

Note that q = — log(l — p) < p + p 2 , and so q also satisfies this equality (with a slightly 
different constant C 2 ). 

Let the elements of A C [n] 2 be chosen independently at random, each with probability 
p, and suppose that A percolates. Then, by Lemma [83 there exists a rectangle R C [ra] 2 , 
which is internally spanned by A, and with B/2p ^ lg(-R) =S5 -B/p, where B = 101og(l/p). 
Let dim(i?) = (a, b), and assume without loss of generality that a ^ b. There are at most 
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n 2 (B/p) 2 potential such rectangles, and each is internally spanned with probability at 
most 

/ \ / 1 2 f aq 1 0(1) / 1\ 3N 

F p ([AnR] =R) s= exp I - |- ^ g{z)dz + (b - a)g(aq)^ + ^log-j 

if sh(i?) ^ 1/q, by Proposition [T5| and with probability at most 



e -bg(aq) ^ g -B/40p _ ^l/Ap ^ . _ 



^ \ 100 



11 



if a = sh(i?) ^ 1/q and n is sufficiently large, since g(aq) ^ g(l) = — log/3(^-) > 1/20 
and fe ^ B/2p. Note that, since we apply Proposition [L31 with C = 10, we obtain an 
absolute constant 0(1) in the expression above. 
By Observation [19| we have 

2 r aq 2A 

- / g{z)dz + (b-a)g(aq) ^ 1, 

q Jo q 

where A = 7r 2 /18. Thus, using the identity = ~ + , ^_ ) ; this gives, as n — > oo, 




P p ([A] = [n] 2 ) < n 2 (5/p) 2 exp 

^ ra 2 (i?/p) 2 exp ( — 21ogn — ^-(log log n) 3 \/logn + ^iil (log - 

\ A Vq \ q 

^ n 2 (log n) 3 exp ^ — 21ogn — (log log n) 3 ^/log n^j — > 
if C2 is sufficiently large, as required. □ 

5. Extensions and open questions 

In this paper we have studied bootstrap percolation on one particular graph, the two- 
dimensional grid with nearest-neighbour bonds. It is natural to ask whether our method 
can be applied to bootstrap percolation on other graphs; here we shall discuss two such 
possible generalizations. 

The most obvious (and most extensively studied) generalization is to consider bootstrap 
percolation in d dimensions (i.e., on the graph [n] d ), with nearest neighbour interaction 
and threshold 2 ^ r ^ d (as studied in, for example, [2j El El [161 13 SO])- The sharp 
metastability thresholds for these models (with d fixed, and as n — > 00) were determined 
in [8], and it is likely that the methods of this paper (and those of [28]) could be adapted 
to give improved bounds in the case of r = 2 and general d. 

Problem 1. Bound the second term in the asymptotic expansion ofp c ([n] d , 2) as n — > 00. 
The case r ^ 3 is more complicated, and the following problem is likely to be difficult. 
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Problem 2. Bound the second term in the asymptotic expansion ofp c ([n} 3 , 3) as n — > oo. 

We remark that in very high dimensions (d 3> logn, say) new ideas are required, and 
much less is known in general. However, results analogous to Theorem [1] have been proved 
in the special cases r = 2 and r = d, see [SI [7J. 

A second natural generalization is to consider bootstrap percolation in two dimen- 
sions, but with a different update rule. For example, in the 'modified' bootstrap process 
(see [32]), a vertex is infected if at least one of its neighbours in each dimension is already 
infected; in the '/c-cross' process (see [331 f!5] ). a vertex v is infected if at least k vertices 
in the cross-shaped set 

|J {v + (0,j),v + (j,0)} 

0^e[-fe+l,fc-l] 

are previously infected; and in the Frobose process (introduced by Frobose [25] in 1989) a 
site of [n] 2 is infected if it has one already- infected neighbour in each dimension, along with 
the next-nearest neighbour in the corner between them. In general, one could consider 
an arbitrary neighbourhood N(v) of each vertex v, an arbitrary (monotone) family A(v) 
of subsets of N(v), and say that v becomes infected if the already-infected subset of its 
neighbours is in A(v). 

Holroyd [3T] (see also [32]) determined the sharp threshold for the modified and Frobose 
models, and Holroyd, Liggett and Romik [33] did so for the /c-cross process for all fixed 
fceN. Moreover, Duminil-Copin and Holroyd [22] have recently shown, for a large family 
of such models (including all of the examples above, and other similar models), that there 
exists a sharp metastability threshold. It is not unreasonable to hope that our method 
(together with that of [28]) might yield improved bounds on the critical probability for a 
more general collection of bootstrap processes, of the type considered in [22] . Indeed, for 
two of the processes described above this is the case. 

Let pT\[ti] 2 ) denote the critical probability for percolation in the Frobose process on 
[n] 2 , and let pc + ^([n] 2 , k) denote the critical probability for percolation in the fc-cross pro- 
cess. The upper bounds in the following theorem were proved by Gravner and Holroyd [28] 
(for the Frobose model) and by Bringmann and Mahlburg [J5] (for the fc-cross process). 
The lower bounds follow by the methods of this paper. 



Theorem 20. 



p! f) (N 2 ) 



6 logn (logn) 3 / 24 " ' 1 ) 
as n — > oo. Let fceN, and let Xk = n 2 /3k(k + 1). Then 

n (+)(\n] 2 k) = — 1 

Pc IFJ ■ • lQgn ( logn )3/2+o(l) 

as n — > oo. 

In fact the bounds we prove (and those from [281 US]) are a little stronger than those 
stated above; they are like the bounds in Theorem [TJ 
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Sketch of proof of Theorem\W^ For the first part, it suffices to show that (in the Frobose 
process) on R = [m] x [n], all spanning sets have size at least m + n — 1. The result then 
follows in exactly the same way as Theorem [TJ Indeed, simply replace the function g by 
the function 

h(z) = -log(l-e- z ), 

and note that a rectangle is crossed if, and only if, it has no empty column. The rest of the 
proof carries over essentially verbatim, the key point being that J* h(z) dz ~ xlog(l/x) 
when x — > 0, and F p (I(R u )) ~ exp ( — <f)(R u )h(aq)) when u is a seed, so the corresponding 
terms in the final calculation are of the same order. 

We shall give two proofs that if [A] = R then |A| ^ m + n — 1. The first is standard, 
using Proposition 30 of [31] and induction on <f)(R) (see Lemma 7 of [1] or Problem 35 
of [H]). For the second, consider the (bipartite) graph G whose vertices are the rows and 
columns of R, with an edge from row x to column y if and only if (x, y) G A. 

To prove that G has at least m + n — 1 edges, we shall show that it is connected. 
Indeed, if G is not connected then exists a set of rows X and a set of columns Y such 
that A C S = (X n Y) U (X c n Y c ). But then [S] = S ^ R, so A does not percolate, as 
required. 

For the second part, we need the following idea from [33]: first couple the fc-cross process 
with an 'enhanced process' (see [33], Section 5) in which the closed sets are rectangles. 
In the enhanced process the minimum number of sites required to infect an [m] x [n] 
rectangle is about (m + n)/k, which is also the typical number required. (To prove this, 
apply the standard proof, by induction on m + n.) 

The result now follows by the proof of Theorem [H replacing the function g by the 
function — log/(e _z ), where / : [0, 1] — > [0, 1] is decreasing and satisfies 

f k _ f k+i = x k _ x k+i^ 

and noting that if a rectangle is crossed in the enhanced process, then it has no £ A;-gap' 
of k successive empty columns (see Lemma 12 of |33j). 

We obtain a sufficiently strong bound on ¥ P (I(R U )), where u is a seed, using the proof 
of Lemma |2] this works because our lower bound on \A D R\ is also the typical size of 
a percolating set in R when 4>(R) 1/p. It follows that the contribution of the large 
seeds to the final calculation is of the same order as that of the integral J Q X — log f(e~ z ) dz, 
where x is the semi-perimeter of the pod. Modulo a little basic analysis, the rest of the 
proof works as above; we leave the details to the reader. □ 

Gravner and Holroyd [28] also improved the upper bounds for the modified process. 
However, the proof of Theorem [T] does not work for the modified process, since we do not 
have a result analogous to Lemma [21 In particular, it is possible to internally span an 
m x n rectangle with max{m, n} infected sites, but the proportion of such minimal-size 
sets which percolate is very small. 

Let pc ([n] d ) denote the critical probability for percolation in the modified bootstrap 
process on the graph [n] d , i.e., the infimum over p such that the probability of percolation 
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is at least 1/2. We have the following conjecture; it is the analogue of Conjecture [T] for 
the modified process. 

Conjecture 2. As n — > oo, 

(M) ' ' ^- 71-2 



61ogn (log n) 3 /^ 1 ) ' 

Given a rectangle i?, we say that a set A C R is a minimal percolating set if A spans 
i?, but no proper subset of A does so (see [37], for example). Given m ^ n and x ^ 0, 
let F(m,n,x) denote the number of minimal percolating sets of size m + x in modified 
bootstrap percolation on R = [m] x [n]. We remark that Conjecture [2] would follow from 
the method of this paper, together with following bound: 

F{m,n,x) ^ n m-n+2x+o(n) _ 

Note that even if we restrict ourselves to 'threshold' models, in which a vertex is infected 
if at least r elements of its neighbourhood are infected, we still run into similar problems. 
Indeed, consider the model in which a vertex is infected if at least four of its eight neigh- 
bours (including diagonals) are infected. A typical seed R is shaped like an octagon, and 
the number of infected sites used to fill R (in the random process) is roughly <f)(R) (which 
we define to be the number of external vertices plus the number of external edges), while 
the minimal number required to span R is only <fr(R)/2. 

Finally, returning to the standard bootstrap process, recall that Theorem [TJ determines 
the second term of p c ([n] 2 ,2) up to a poly(loglogn)-factor. We ask whether this error 
term can be removed. 

Problem 3. Determine a G [0,3], if it exists, such that 

v (\n? 2) - - ( lQ g lQ g n ) a+Q(1) 

Pc{[n\ ,t) 181ogn ( logn )3/2 

as n — > oo. 

As usual in bootstrap percolation, it would not be unreasonable to suspect that the 
upper bound in Theorem [1] is closer to the truth. 
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